Edited Naive Bayes

نویسندگان

  • José María Martínez-Otzeta
  • Basilio Sierra
  • Elena Lazkano
  • Maier Ardaiz
  • Ekaitz Jauregi
چکیده

Naive Bayes is a well-known and studied algorithm both in statistics and machine learning. Bayesian learning algorithms represent each concept with a single probabilistic summary. This paper presents a variant of the Naive Bayes method, in which the original training set is augmented in the following fashion: Leave-One-Out procedure is applied over the training set, and incorrectly classified instances according to Naive Bayes model are duplicated. The augmented dataset is used to induce the model. The motivation behind this idea is that giving more importance to hard instances (in this case, duplicating them) might contribute to make the model more accurate over that subset of the instance space. We have tested this algorithm over 41 UCI datasets. The results suggest that the chance of obtaining a significant better performance than with the original Naive Bayes approach are much greater than the

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Diagnosis of Pulmonary Tuberculosis Using Artificial Intelligence (Naive Bayes Algorithm)

Background and Aim: Despite the implementation of effective preventive and therapeutic programs, no significant success has been achieved in the reduction of tuberculosis. One of the reasons is the delay in diagnosis. Therefore, the creation of a diagnostic aid system can help to diagnose early Tuberculosis. The purpose of this research was to evaluate the role of the Naive Bayes algorithm as a...

متن کامل

A New Approach for Text Documents Classification with Invasive Weed Optimization and Naive Bayes Classifier

With the fast increase of the documents, using Text Document Classification (TDC) methods has become a crucial matter. This paper presented a hybrid model of Invasive Weed Optimization (IWO) and Naive Bayes (NB) classifier (IWO-NB) for Feature Selection (FS) in order to reduce the big size of features space in TDC. TDC includes different actions such as text processing, feature extraction, form...

متن کامل

In silico prediction of anticancer peptides by TRAINER tool

Cancer is one of the causes of death in the world. Several treatment methods exist against cancer cells such as radiotherapy and chemotherapy. Since traditional methods have side effects on normal cells and are expensive, identification and developing a new method to cancer therapy is very important. Antimicrobial peptides, present in a wide variety of organisms, such as plants, amphibians and ...

متن کامل

پیش بینی میزان آلودگی فلزات سنگین در رسوبات رودخانه گرگانرود با استفاده از داده کاوی

به منظور پیش بینی میزان آلودگی فلزات سنگین در رسوبات رودخانه گرگانرود با استفاده از داده کاوی، در طول رودخانه گرگان رود نمونه های رسوبی در دو فصل (بهار و تابستان) و در 10 ایستگاه با سه تکرار نمونه برداری گردید. پس از آنالیز دستگاهی نمونه ها، داده های خام فلزات سنگین جمع آوری شد. سپس روش پیشنهادی مطرح گردید که شامل مراحل شروع و گردآوری داده ها، پیش پردازش داده ها ، ساخت مدل و همچنین ارزیابی و خر...

متن کامل

Learnability of Augmented Naive Bayes in Nonimal Domains

It is well-known that Naive Bayes can only represent linearly separable functions in binary domains. But the learnability of general Augmented Naive Bayes is open. Little work is done on the learnability of Bayesian networks in nominal domains, a general case of binary domains. This paper explores the learnability of Augmented Naive Bayes in nominal domains. We introduce a complexity measure fo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Inteligencia Artificial, Revista Iberoamericana de Inteligencia Artificial

دوره 10  شماره 

صفحات  -

تاریخ انتشار 2006